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Abstract 

Sub-additive and super-additive inequalities for concave and convex functions have 
been generalized to the case of matrices by several authors over a period of time. 
These lead to some interesting inequalities for matrices, which in some cases coincide 
with, and in other cases are at variance with the corresponding inequalities for real 
numbers. We survey some of these matrix inequalities and do further investigations 
into these. 

We introduce the novel notion of dominated majorization between the spectra of 
two Hermitian matrices B and C, dominated by a third Hermitian matrix A. Based 
on an explicit formula for the gradient of the sum of the k largest eigenvalues of a 
Hermitian matrix, we show that under certain conditions dominated majorization 
reduces to a linear majorization-like relation between the diagonal elements of B 
and C in a certain basis. We use this notion as a tool to give new, elementary proofs 
for the sub-additivity inequality for non-negative concave functions first proved by 
Bourin and Uchiyama and the corresponding super-additivity inequality for non- 
negative convex functions first proven by Kosem. 

Finally, we present counterexamples to some conjectures that Ando's inequality 
for operator convex functions could more generally hold, e.g. for ordinary convex, 
non-negative functions. 
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1 Introduction 



Two of the basic properties that a real- valued function f{x) defined over the 
reals can possess are sub-additivity and super-additivity. Sub-additivity means 
that for all x, y in the domain of /, 

fix + y)<fix) + f{y), 

while super-additivity means the opposite 

fix) + fiy)<fix + y). 

Two classical theorems that characterise sub- and super-additivity for func- 
tions defined on M"*" (although not completely) are presented as Theorem 7.2.4 
and 7.2.5 in [12]. Their Theorem 7.2.4 states that functions / for which f{t)/t 
is decreasing in M4. are subadditive. Theorem 7.2.5 in [12j states that any 
measurable concave function / is subadditive in ]R_|_ iff /(0+) > 0. 

In recent years, ongoing effort has been spent to characterise matrix functions 
exhibiting similar sub-additivity or super-additivity properties. Of course, 
many variations on this theme are possible, and in this paper we restrict 
attention to sub- and super-additivity in norm for non-negative functions. For 
a given unitarily invariant norm 1 1 1 ■ 1 1 1 , these amount to the norm inequalities 
|||/(A) + f{B)\\\ < \\\f{A + B)\\\ (or reversed), with positive semidefinite A 
and B, but one can equally well consider the inequality |||/(A) — /(-B)||| < 
|||/(|y4 — i?|)||| (or reversed). Historically, these inequalities have been proven 
first for operator monotone, and/or operator concave functions /, and only 
later have they been generalised to non-negative functions that are concave 
and/or convex. Interestingly, the proofs of these generalisations exploit the 
corresponding results for operator monotone/concave functions. 

In this paper we first give a historical overview of these developments, in 
Sections [3] and HI Then we resolve a number of still open questions regard- 
ing the inequality \\\f{\A - B\)\\\ < |||/(A) - /(5)|||, which is known to be 
true for operator convex functions. We show by counterexample that it does 
not hold in general for non-negative convex functions, nor do a number of 
successively weakened versions. By imposing the condition A > ||i?||oo, we 
obtain the closest match of this inequality that does hold for convex functions 
(or in reversed sense for concave functions), namely the eigenvalue inequality 
XiifiA - B)) < \iU{A) - fiB)), for all k. 

In Section [6l we present a new and elementary proof of a sub-additivity norm 
inequality for non-negative concave functions and a super-additivity norm in- 
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equality for non-negative convex functions, that do not rely on the correspond- 
ing inequality for operator monotone/convex functions, nor on the theory of 
operator monotone functions. The proof exploits the novel notion of domi- 
nated majorization between the spectra of two Hermitian matrices B and C, 
dominated by a third Hermitian matrix A. Based on an explicit formula for 
the gradient of the sum of the k largest eigenvalues of a Hermitian matrix, we 
show that under certain conditions this dominated majorization reduces to a 
linear majorization-like relation between the diagonal elements of B and C in 
a certain basis. This is explained in full detail in Section |S] (with one of the 
proofs postponed to Section [7]). 



2 Preliminaries 

In this section, we introduce the notations and necessary prerequisites; a more 
detailed exposition can be found, e.g. in [7]. 

Throughout, M„ shall denote the set of ra x n complex matrices and shall 
denote the set of all Hermitian matrices in M„. We shall abbreviate the terms 
positive semidefinite and positive definite by PSD and PD, respectively. By 
A> B,we mean that A — B > 0. Let / be an interval in M. We shall denote by 
M^(J), the set of all Hermitian matrices in M„ whose spectrum is contained 
in the interval I. 

We denote the identity matrix by I, and use the shorthand a = al for scalar 
matrices. 

We denote the absolute value by | ■ |, both for scalars and for matrices. For 
matrices this is defined as \A\ := {A*AY^'^. Similarly, we denote the positive 
part of a real scalar or Hermitian matrix by (■)_!_, and define it by A^ : = 
(A+ 1 A|)/2. We denote the vector of diagonal entries of a matrix A by Diag(A). 
We will use the abbreviations LHS and RHS for left-hand side and right-hand 
side, respectively. 

Let A G M^(/) have the spectral decomposition 

A = f/*diag(Ai,A2,...,A0t/ 

where f/ is a unitary matrix and Ai, A2, . . . , A„ are the eigenvalues of A. Let / 
be a real valued function defined on /. Then f{A) is defined by 

fiA) = f/*diag(/(Ai), /(A2), . . . , /(A„))f/. 

Let n G N be arbitrary but fixed. The function / is called matrix monotone 
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of order n on I li 

A>B ^ f\A)>f{B) 
for all A, B E M,^(/), and matrix convex of order n on I if 

f{aA + (1 - a)B) < af{A) + (1 - a)f{B) 

for all < a < 1 and A,Be M^(/). Likewise, / is called matrix concave of 
order n on I if —f is matrix convex of order n on /. If the function / is matrix 
monotone of all orders n on I then / is called operator monotone on I. The 
operator convexity and operator concavity are defined similarly. 

A norm 1 1 1 ■ 1 1 1 on M„ is called unitarily invariant (UI) or symmetric if 

\\\UAV\\\ = \\\A\\\ 

for all A G M„ and for all unitary U,V E M„. The most basic unitarily 
invariant norms are the Ky Fan norms 1 1 ■ | |(fc), (/c = 1, 2, ■ ■ ■ , n), defined as 

k 

Pll(fc) =E^i(^)' {k = l,2,---,n) 



and the Schatten p-norms defined as 



1 < p < oo, where (Xi > (T2 > • • ■ > o"„ are the singular values of A G M„, 
that is, the eigenvalues of \A\. The spectral norm (or operator norm) is given 
by ||v4||oo = Si{A) = limp^oo ||^| 



The famous Ky Fan dominance theorem states that a matrix B dominates 
another matrix A in all UI norms if and only if it does so in all Ky Fan norms. 
The latter set of relations can be written as a weak majorization relation 
between the vectors of singular values of A and B: 

k k 

a\A) a\B) : J] a, (A) < J] 1 < A; < n. 

i=i i=i 

For PSD matrices, the above domination relation translates to a weak ma- 
jorization between the vectors of eigenvalues: \^{A) -<w \^{B). Here, \^{A) 
denotes the (real) vector of eigenvalues of A sorted in non- increasing order. 

Weyl's monotonicity theorem ([7J, Corollary III. 2. 3) states that 

\i{A)<\i{A + B), l<k<n, 

for Hermitian A and PSD B. 
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Finally, we refer the reader to Chapter 2 of [H] for an exposition of a number 
of important functional analytic properties of eigenvalues and corresponding 
eigenspaces of a Hermitian matrix, which we will need in the proof of Theorem 

m 



3 Comparison of norms \\\f{A) + f{B)\\\ and \\\f{A + B)\\\ 

For PD matrices A, B, McCarthy [19j proved that 

P'^ + ^nii < + l<r<oo 

and 

\\A' + B'\U>\\{A + BY\\i, 0<r<l. 

Bhatia and Kittaneh |H] proved the above-mentioned inequalities for the op- 
erator norm. There they also proved that 

+ < \ \\{A + B)"'\\\, m = 1,2,... 

for A,B > and conjectured that if / is operator monotone function on [0, oo) 
with /(O) = then 

\\\f{A + B)\\\<\\\f{A) + f{B)\\\. (1) 

Hiai also posed this conjecture in [H]. Ando and Zhan affirmatively settled 
this conjecture in [2]. As a corollary they obtained that if / is an increasing 
function on [0, oo) with /(O) = 0, f{oo) = oo and if the inverse function of / 
is operator monotone then 

\\\f{A + B)\\\>\\\f{A) + m\\\. (2) 

Since the inverse function of a non-negative operator convex function on [0, oo) 
with /(O) = is operator monotone [1], we conclude that inequality holds 
for any operator convex function on [0, oo) with /(O) = 0. In [5] it was shown 
that if the non-negative functions /, g on [0, oo) satisfy inequality (|2]) then the 
functions f + g, fog and fg also satisfy ([2]). It was further shown that any 
polynomial p with non-negative coefficients and p{0) = satisfy ([2]). 

This prompted the authors to conjecture in [5] that any non-negative con- 
vex function on [0, oo) with /(O) = should also satisfy Note that such 
functions must automatically be increasing functions. Using the fact that a 
non-negative convex function on [0, oo) with /(O) = can be approximated 
uniformly on a finite interval by a positive linear combination of angle func- 
tions, Kosem settled this conjecture affirmatively in [T7]. Later on Bourin and 
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Uchiyama proved ([10]; see also jl]) that any non- negative concave function 
on [0, oo) (again such functions must be increasing) satisfies ([1]). 



It is shown in [3f5] that if a non-negative function / satisfies ([T]) then it is 
concave and if it satisfies ([2]) then it is convex with /(O) = 0. Hence within 
the set of non-negative / these resuhs give a full characterisation of all possible 
/ satisfying these inequalities. This completes our discussion in this section. 



4 Comparison of norms \\\f{A) - f{B)\\\ and \ \\f{\A-B\)\\\ 

We begin this section with the inequality of Powers and St0rmer [5T] , derived 
in the course of their work on free states of the canonical anti-commutation 
relations. They proved that if A, B are PSD then 

Kittaneh [15] generalized this to show that 

\\A^I'-By'^\l<\\A-B\\, 
for 1 < J9 < oo. Note that for any matrix T, we have = ||T*T||p, 1 < 



p < oo, so this result of Kittaneh can be restated as 

\\{A-B)%< 



Bhatia [6] proved this inequality for all unitarily invariant norms. There he 
also proved that 

\\{A-Bf\\,<W-B'%, k = l,2,.... 
The above inequality when specialized to the p— norms gives 

for all integers m of the form 2'^, k = 1,2, . . ., which is an interesting general- 
isation of the Powers-St0rmer inequality. 

In [H] Birman, Koplienko and Solomyak proved that 

P'-^nioo < II iA-5nioo, o<r<i 

for all A,B > 0. Note that the function f{x) = is operator monotone on 
[0,oo). This motivated Kattaneh and Kosaki to prove that if / is non- 
negative operator monotone on [0, oo) then 

\\f{A) - f{B)\\^ < f{\\A- B\U = \\fi\A- B\)\U 
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for all A,B > 0. Then Ando [T] proved that if / is non-negative operator 
monotone on [0, oo) then 



|||/(A)-/(i?)|||<|||/(|A-i?|)||| (3) 

A,B > 0, for all unitarily invariant norms. As a corollary to this result, Ando 
deduced that the reverse inequality holds for all functions / on [0, oo) with 
/(O) = and /(oo) = oo if the inverse function of / is operator monotone. 
Since the inverse function of a non-negative operator convex function on [0, oo) 
with /(O) = is operator monotone [1], we conclude that if / is operator 
convex on [0, oo) with /(O) = then we have 

|||/(A)-/(i?)|||>|||/(|A-i?|)|||. (4) 



Afterward, Mathias [18] proved that the inequality (j3j) holds for any non- 
negative matrix monotone function of order n on [0,oo). One may wonder 
whether, in a similar vein, inequality dl]) can be proved for a non-negative 
increasing matrix convex function / of order n on [0, oo) with /(O) = 0. 

We have seen that inequality ([1]) holds for non-negative increasing concave 
functions on [0, oo) and inequality (|2]) holds for non-negative increasing convex 
functions on [0, oo) with /(O) = 0. In the same spirit, we consider the question 
whether inequalities and dl]) can also be generalized to non-negative con- 
cave and convex functions respectively. We raise and answer several questions 
in this direction. 

Question 1 For all A, B, > 0, for all UI norms, and for non-negative in- 
creasing convex functions g on [0, oo) with g{0) = 0, does the inequality 
\MA)-giB)\\\>\\\g{\A-B\)\\\ hold? 

The answer to this question is negative, as shown by the following counterex- 
ample. We consider the convex angle function g{x) = x -\- {x — 1)+ and the 
operator norm. For the 2x2 PSD matrices 



A = 



^0.9 o\ 


f 0.8 0.5\ 






^ 0.6 y 


\^0.5 0.4 J 



the eigenvalues of g{\A — B\) are 0.65249 and 0.35249, while those of g{A) — 
g{B) are 0.65010 and -0.48862. Thus, \\g{\A - B\)\\^ = 0.65249, which is 
larger than \ \g{A) - giB)\\^ = 0.65010. □ 

Under the additional restriction A> B, the absolute value in the argument of 
g in the RHS vanishes, leading to a simplified statement and a second question, 
with better hopes for success. Introducing the matrix A = A — B, 
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Question 2 For all B, A > 0, for all UI norms, and for non-negative in- 
creasing convex functions g on [0, oo) with g{0) = 0, does the inequality 
|||^?(S + A)-^?(S)|||>|||^?(A)||| hold? 

This restricted case also turns out to have a negative answer. Counterexam- 
ples, however, were much harder to find, and required a reduction of the prob- 
lem based on certain results about a novel majorization-like relation, which 
we call dominated majorization. This will be the subject of Sections [5] and O 
where a number of results of independent interest are proven. 

It is also very reasonable to ask: 

Question 3 For all B,A>0, for all UI norms, and for non-negative increas- 
ing concave functions f on [0, oo), does the inequality \ \\f{B + A) — f{B)\\ \ < 
|||/(A)||| hold? 

Again, this statement is false, as the following counterexample shows. Consider 
the concave angle function f{x) = min(a;, 1) = x — {x — 1)+, and the 3x3 
PSD matrices 



0.701816 0.317887 0.198910^ 



B 



0.317887 1.014950 -0.093826 
0.198910 -0.093826 0.274236 



and 



0.192713 

0.446505 
0.455416 



One gets 
while 



||/(A)|U = 0.455416 
+ A) -/(5)|U = 0.455776. 



□ 



Next we consider an even more restricted special case, in which the inequalities 
([3]) and (jlj) finally do hold. We actually prove that a stronger relationship 
holds in this special case. We shall use the notation X^{X) < X^{Y) whenever 
XiiX) < Xi{Y) holds for all k. 

Theorem 1 For a non-negative, increasing concave function g on [0, oo), and 
matrices A,B >0 such that A > ||i?||oo, we have 

X\giA-B))>\\giA)-giB)). (5) 
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An easy corollary is the corresponding statement for non-negative convex func- 
tions. 

Corollary 1 Let f be a non-negative strictly increasing convex function on 
[0,cx)) with /(O) = 0. Let A, B > be such that A > \\B\\^. Then 

XHf{A-B))<\\f{A)-f{B)). (6) 



Proof. Let / = g^^, with g satisfying the conditions of Theorem [H Upon 
replacing A by f{A) and B by f{B), the condition A > ||-B||oo is unharmed 
as / is monotonous. Furthermore, ([5]) becomes 

X\g{fiA)-fiB)))>\\A-B). 

Applying the function / on both sides does not change the ordering, again 
because of monotonicity of /, and yields validity of inequality dS]). □ 

These two results obviously imply the corresponding majorization relations, 
and by Ky Fan dominance, relations in any UI norm. 

Proof of TheoremUi W.l.o.g. we will assume ||-B||oo = 1, since any other value 
can be absorbed in the definition of g. 

It is immediately clear that if ([5]) holds for g that in addition satisfy g{0) = 0, 
then it must also hold without that constraint, i.e. for functions g{x) + c, with 
c > 0. This is because the additional constant c cancels out in the LHS, while 
XH9iA-B) + c)>XHgiA-B)). 

Furthermore, remains valid when replacing g{x) with ag{x), for a > 0. 
Thus, w.l.o.g. we can assume g{0) = and g{l) = 1. Together with concavity 
of g, this implies that, for < x < 1, g{x) > x, while for x > 1, the one- 
sided derivative g'{x) < 1 (since concave functions need not be differentiable 
everywhere, we have to use the one-sided derivative g'{x) = \imt^Q+{g{x + t) — 
g{x))/t). 

Since < i? < I, and for < x < 1, g{x) > x holds, we have g{B) > B, 
or —g{B) < —B. By Weyl monotonicity, this implies X'^{g{A) — g{B)) < 
X'^{g{A)—B). Thus, statement ([5]) would be implied by the stronger statement 

X\g{A)-B)<X\g{A-B)). (7) 



Now note that the argument of g in the LHS satisfies A >I. Thus, in principle. 
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we could replace g{x) in the LHS by another function h{x) defined as 
h{x) -- 



g{x), if X > 1 
X, otherwise. 



If we also do that in the RHS, we get a stronger statement than ([7]). Indeed, 
h{x) < g{x) for X > and A - 5 > 0, and therefore h{A - B) < g{A - B) 
holds. By Weyl monotonicity again, we see that ([7]) is implied by 

X\h{A)- B) <\\h{A- B)). (9) 



The importance of this move is that h{x) is still an increasing and concave 
function (because g'{x) < 1 for a; > 1), but now has h'{x) < 1 for x > 0. 

Defining C = A — B, which is positive semi-definite, we now have to show the 
inequality 

\i{h{C + B)-B)< \i{h{C)) = MAi(C)), 

for every k. Fixing k, and introducing the shorthand Xq = Xj.{C), we can 
exploit concavity of h to bound it from above as h{x) < a{x — xq) + h{xo), 
where a = h'{xo) < 1. Again by Weyl monotonicity, we find 

Xiih{C + B)-B)< Ai(a(C + B - xq) + h{xo) - B) 
= Xl{aC + (a - 1)B - axo + h{xo)) 
< X^{aC) — axo + h{xo) = h{xo), 

where in the second line we could remove the term (a — 1)B because it is nega- 
tive. This being true for all k, we have proved (Q and all previous statements 
that follow from it, including the statement of the theorem. □ 



5 Dominated majorization 



We have already pointed out that inequalities ([I])-® were proven first for op- 
erator convex or operator concave functions, being extended only afterwards 
for ordinary convex/ concave functions. Moreover, the proofs for ordinary con- 
vex/concave functions actually exploited the corresponding results for opera- 
tor convex/concave functions. This may seem somewhat unnatural and it is 
not unreasonable to ask for a more direct proof. 

In this section we introduce a number of new ideas and techniques which, 
although they may seem strange and somewhat contrived at first, will lead 
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to new, elementary proofs of inequalities ([I])-® that bypass the Ando-Zhan 
theorem and do not require the machinery of operator monotone and opera- 
tor convex functions. Secondly, we will use this technique to try and answer 
Question 2 raised in the previous section. 

Let us consider three Hermitian matrices A, B and C and assume that there 
exists ao > such that the following relation holds for all a > ao, and for 
certain (possibly all) values of k: 

k k 

Y,\]{aA + B) <Y,\]{aA + C). (10) 

As it holds for all a > oq, it should be possible to simplify this condition. 
Subtracting Z]^=i Aj(ayl) from both sides, and substituting a = 1/t, we obtain 

- j:{\]{A + tB) - \]{A)) < - jz{\\{A + tC) - Aj(A)), 
^ i=i ^ i=i 

for all < t < to = l/flo- In the limit of positive t going to 0, this yields a 
comparison between directional derivatives of sums of k largest eigenvalues: 



d_ 

di 



k 



j:xjiA+tc). (11) 

t^Q+ j=l 



Let us introduce the vector 6{B] A) defined as: 



j^XjiA + tB). (12) 



j:S,iB;A) :=| 

j=i 



With this notation, relation ([TT|) becomes 

k k 

Y.5,{B-A)<Y.5,{C-A). 

That is, the entries of 5{B] A) are related via a majorization-like relation 
(without the usual rearrangement) to those of 5(C; A). 

To simplify the notations, we will use the symbol for this relation: 

k k 

a ~<u,h ^^^aj <^hj, (13) 
i=i i=i 
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and explicitly put rearrangements in the vectors concerned by use of the sym- 
bols t and J,. In that way, we write the classical majorization relation as 

With these notations relation ( ITT]) is expressed as 

6{B;A) ^^5iC;A). (14) 



We call this relation A-dominated majorization or A-majorization for short. 

Definition 1 Consider three Hermitian matrices A, B and C . When the re- 
lation ( [77]) holds, or equivalently, [14\ ), we say that B is A-majorized by C . 

The argument shown above proves the following: 

Proposition 1 Let A, B and C be Hermitian matrices. If there exists ao > 
such that Y!j=i \\{aA + B) < J2j=i >^]{aA + C) holds for all a > ao, then 
5iB;A) ^^5iC;A). 



5.1 Directional derivative of the sum of the k-th largest eigenvalues 



It turns out that there is a very simple way to calculate 6{B;A), based on 
an explicit expression of the directional derivative of the sum of the k largest 
eigenvalues of a symmetric matrix, which is well-known in numerical analy- 
sis (see [13] and references therein, and ISO])- The directional derivative of a 
convex function is defined as follows ([13], Section 2.2): 

Definition 2 Let f{x) be a convex function defined on a subset O of a Eu- 
clidean space X . For any x E O, and d E X, the directional derivative of f at 
X in the direction d is defined as 

/-(x.d)^l..n^'^ + ""-^W. 



It is essential that the limit t — )■ O"*" is taken because / need not be differen- 



tiable. We will denote this directional derivative by the symbol ^ 



Consider an n x n Hermitian matrix A, and let its eigenvalues, sorted in 
non-increasing order, be denoted by Aj(A), j = l,2,...,n. Let its distinct 
eigenvalues, sorted in decreasing order, be denoted by fii{A), i = 1,2, ... ,m 
(with m the number of distinct eigenvalues) and the corresponding multiplic- 
ities by Tj. Thus J^iLi^i = fT-- The sum of the k largest eigenvalues of A can 
be written in terms of the Hi as follows: writing k a.s k = ri+r2 + ■ ■ ■ + ri + 
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where 1 < s < r/+i, 

k I 

j=i i=i 

Furthermore, let Pi denote the projector onto the i-th eigenspace of A, corre- 
sponding to eigenvalue fii{A). Thus, Pi is a matrix of dimensions x n. The 
spectral decomposition of A can then be written as 

m 

A = Y.y^M)p:p,. 

i=l 



The following is a reformulation of Corollary 3.9 in fT3], which was proven 
there for real symmetric matrices. 

Proposition 2 Let A he a real n x n symmetric matrix with spectral de- 
composition A = fii{A) P* Pi and multiplicities rj. Let B also he a real 
n X n symmetric matrix. With k written as k = ri + r2 + ■ ■ ■ + ri + s , where 
1 < s < r;+i, the directional derivative of J2'j=i ^j{A) in direction B is given 
by 



d_ 

dt 



Y: Xj{A + tB)=Y: Tr PiBP* + J] Aj(P,+ii?P;+,). 



(15) 



t^o+ j=i 



i=l 



Note that, when s = r;+i, this formula simplifies to 



d_ 

di 



j^^jiA + tB) = 'j2Ti P.BP; 

t^0+ j=l i=l 



(16) 



We summarise what we really need to know about this proposition in the 
following theorem (quietly extended to the complex case). 

Theorem 2 Let A and B he Hermitian matrices. With 6{B] A) defined hy 
ffM), the entries of the vector 5 [B] A) are the diagonal entries of B in a certain 
hasis in which A is diagonal and its diagonal entries appear sorted in non- 
increasing order. When all eigenvalues of A are simple (i.e. have multiplicity 
1), this hasis is just the eigenhasis of A and does not depend on B. 

An independent proof of this theorem, that also works for complex Hermitian 
matrices, is presented in Section [71 

The upshot of Theorem [2] is that there exists a unitary matrix U such that 
U*AU = A^{A) and 6{B;A) = I)iag{U*BU). In other words, 6{B;A) is the 
vector of diagonal elements of B, in a particular basis governed by A, and 
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possibly by B too. In the generic case that all \i{A) are distinct, U is unique 
and does not depend on B, hence in that case S{B; A) is the vector of diagonal 
elements of B in the eigenbasis of A. 



5.2 Dominated majorization for co-diagonal matrices 

Let us now specialise to the case where A and B commute and there is a 
common basis in which the diagonal elements of A and B appear in the same, 
non-increasing order. We will say that A and B that satisfy this condition are 
CO- diagonal. 

According to Proposition^ validity of ffTOj) for all a > implies A-majorization, 
f lT4|) . Theorem |2] now immediately leads to the following proposition, which 
says that for co-diagonal A and B, validity of ffTOj) for all a > is actually 
equivalent with A-majorization. 

Proposition 3 For Hermitian A, B, C , where A and B are co-diagonal, the 
following are equivalent: 



(dZD implies (fT8|): 

If relation f lTU]) holds for all a > 0, then it holds for a tending to infinity. By 
Proposition [T] we then get that B is A-majorized by C. 

(HH]) implies ([T^: 

Let us add aX^{A) to both sides of (ITHj) . By Theorem^ 5{B] A) is the vector of 
diagonal elements of 5, in a basis in which A is diagonal and the eigenvalues 
of A appear sorted in non-increasing order. Thus, Va > 0, 5(5; A) + a\^{A) = 
6{B + aA] A). The same holds for C. 

(fT9|) implies (fTTl): 

By the co-diagonality of A and B, a A -\- B is diagonal in any basis in which 
A is diagonal. Hence, the LHS of (fT9!) is equal to \^{aA + B). By Schur's 
majorization theorem, the RHS of (1191) is majorized by X^{aA + C). □ 



X^{aA + B)^^\^{aA + C), Va > 
S{B;A) ^^6{C; A) 
6{aA + B;A)^^5{aA + C;A), Va > 0. 



(17) 
(18) 
(19) 



Proof. 
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6 Applications of dominated majorization 

In this section we first use Proposition [3], to give a new, elementary proof 
of inequality ([T]) for non-negative concave functions (which readily implies 
validity of inequality for non-negative convex functions), that does not 
rely on the Ando-Zhan inequality for operator concave functions, nor on the 
theory of operator monotone functions. 

Then, we answer Question 2 in the negative by exhibiting a counterexample. 
Here, too. Proposition |3] was instrumental. 

6.1 A new proof of inequality (J\) for non-negative concave functions 

We want to prove that 

\\\f{A + B)\\\<\\\f{A) + f{B)\\\ 

holds for all non- negative concave functions f{x). Therefore, it should hold in 
particular for all functions /(x) = b + ax + fo{x) , where /o is non-negative 
concave with /o(0) = and /o(+oo) = 0, and for all a,b > 0. Inserting this 
in the eigenvalue- majorization form of inequality ([T]), we get the majorization 
relation 

X^{b + a{A + B) + fo{A + B)) \^{2b + a{A + fi) + f^{A) + /o(5)), 

for A,B >0. Clearly, this is strongest for 6 = 0. Proposition [3] then immedi- 
ately yields the equivalent form 

5{f{A + B)-A + B)^^ 5{f{A) + f{B)- A + B), 

for all non-negative concave functions / (recall that such functions are non- 
decreasing) with /(O) = 0. 

An interesting aspect of this form is that, unlike A, 5 is linear in its first 
argument. Our proof of the equivalent form, stated as Proposition H] below, 
crucially depends on this property. 

Proposition 4 For positive semidefinite A and B, and f a non-negative con- 
cave function with /(O) = 0, 

5ifiA + By,A + B)^^ 6ifiA) + f{B)- A + B). (20) 

Proof. Any non-negative concave function / can be uniformly approximated as 
a positive linear combination of angle functions x ^ x—{x — 1)_|_. By linearity 
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of S, inequality ( 120|) follows if it holds for any such angle function, i.e. 

6{A + B-{A + B- t)+- A + B)^^ 6{A ~ {A - t)+ + B - {B - t)+- A + B), 

which, again by linearity, simplifies to 

6{{A - t)+ + {B- A + B)^^ 6{{A + B- t)+; A + B). 

In fact, for angle functions the latter inequality even holds with rearrangement, 
and we shall prove 

6\{A - t)+ + {B- A + B)^^ S\{A + B- A + B), 

for all t > 0. Letting tr(x) denote the sum J27=i^i of x = (xi, . . . ,Xn), this 
relation can be expressed in a well-known way as 



ti{5{{A - t)+ + {B- t)+; A + B)-s)+< tr(5((A + B- t)+; A + B)~s] 



+ 5 



for all s (and t > 0). Since both vectors 6 are non-negative it suffices to 
consider the case s > 0. In the eigenbasis of A + B, A + B itself is of course 
diagonal, hence the RHS simplifies to Tt{A + B — {s + 

Now we introduce the variable u = s + t. The last inequality has to be valid 
for all values of s and t, thus if we keep the value of u fixed, the inequality has 
to remain true if we maximise the LHS over all values of t in the range [0,^] 
(and set s = u — t). That is. 



max tr(5((A - t)+ + {B - + ; A + B) - u + t) + 

0<t<u 

<Tt{A + B - u)+. (21) 



The next important consequence of the simple behaviour of 6 is that the 
function t F{t) := tr(5((A - t)+ + {B - + ; A + B) -u + t)+ is convex. 
Note first that the positive part function is convex and increasing. Applying 
this to its outer appearance in the definition of F, the required convexity of 
F{t) follows if, for any i, 6{{A - t)+ + {B - A + B)i - u + t is itself a 
convex function of t. This function can be written as {{A — t)^)ii + {{B — 
t)+)ii — u + t, in the eigenbasis of A + B. Hence, convexity follows from the 
convexity of t t-)- {ijj, {A — t)+ip), for any vector ijj, and to see the latter, just 
consider this quantity in the eigenbasis of A and see that it can be written as 
Yyj=i{^j{A)—t)+\4'j\'^, which is a positive linear combination of angle functions 
and, therefore, convex. 

The convexity of F(t) now implies the simple fact that the maximum in the 
LHS of ([2lD maxo<t<„ tr(5((A - 1)+ + {B - A + B)-u + t)+ is achieved in 
one of the extreme points, either in t = or in t = u. Noting that A and B are 
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positive semidefinite, the value achieved in t = is tT{S{A + B; A + B) — u)jf., 
which is identical to the RHS in (12T!) . It therefore only remains to show that 
the value in t = m is also bounded above by the RHS. Using the fact that the 
function tT6{X; Y) is always equal to Ti X, this amounts to the inequality 

Tt{A - m)+ + Tr{B - m)+ < Tt{A + B- u)+. (22) 



Here, the outer appearance of the positive part function in the LHS has been 
removed because its argument is always positive semidefinite. 

To prove inequality fl2^ . recall the norm inequality 

|||A©S||| < |||(|A| + |S|)©0|||, 

valid for any unitarily invariant norm ([7J, Theorem IV. 2. 13). In particular, 
it holds for the Ky Fan norms, and for PSD A and B can be written as the 
eigenvalue majorization 

\HA®B) \^{{A + B)®0). 

Thus, for all M > (again, by non- negativity of A and B, it suffices to consider 
u > 0), 

' ^ ^ M I < Tr I I \ - u 





which is nothing but inequality (122|) . reformulated in terms of 2 x 2 block 
matrices. This ends the proof of the proposition. □ 

One might still object that our proof is not really elementary, relying as it is on 
Proposition [3] and the theory behind it. Strictly speaking, though. Proposition 
[3] is not needed in the proof, and only provided the intuition to try and prove 
the equivalent form (1201) . Indeed, validity of inequality ([T]) follows immediately 
from Proposition m by combining it with Schur's majorization theorem: 



X\f{A + B)) = 5{fiA + By,A + B) 

^^6{f{A) + f{By,A + B) 

X^{f{A) + f{B))- 



As already shown by Ando and Zhan [2], validity of inequality ([T]) for a given 
non-negative increasing concave function / implies inequality ([2]) for the in- 
verse function g = f~^. Hence, in combination with our proof of inequality ([1]), 
this also yields an elementary proof of inequality ([2]) for non-negative convex 
functions g{x) with g{0) = 0, This was first proven independently from ([T]) 
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by Kosem, by appealing to the corresponding inequality for operator convex 
functions. 

For completeness, we repeat the short Ando-Zhan argument here. 

Proof of inequality ^ for non-negative convex functions. Let g{x) be a non- 
negative convex function with g{0) = 0. Thus g is increasing. In particular, g 
applied to vectors is strongly isotone [7]. 

Let f{x) be its inverse function, / = g~^] thus f{x) is a non-negative increasing 
concave function with /(O) = 0. For such /, we have (inequality ([T])) 

XHf{A + B)) XHf{A) + f{B)). 

Since g{x) is strongly isotone, applying g on both sides preserves weak ma- 
jorization: 

gi\\f{A + B))) g{\\fiA) + f{B))). 
This simplifies, by monotonicity of (7, to 

X^A + B) X^{g{f{A) + f{B))). 

Substituting g{A) for A and g{B) for B then yields inequality ([2]). □ 

6.2 Counterexample to Question\^ 

To answer Question [2|, we will first disregard the absolute values and consider 
the property that a convex function / satisfies 

A(/(A))^^A(/(i? + A)-/(S)) (23) 

for all PSD B and A, which is equivalent to the statement 

\U{A-B))^^\U{A)-f{B)) (24) 

for all A > 5 > 0. 

Although it is by no means obvious at this point, when A > strictly. Question 
[2] is equivalent to validity of fl2^ for all stated functions. While it is obvious 
that ([23D implies |||(7(i? + A) — (7(i?)||| > |||(7(A)|||, the opposite is not neces- 
sarily true because of the absolute value implicit in the definition of the norm. 
Nevertheless, it will turn out that a counterexample to (!23|) for some function 
g will indirectly yield a counterexample to Question [2] for some other function 
g{x) = g{x) + ax, with a > large enough, provided A > holds strictly. 
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Here, a must be large enough to make g(B+A)—g{B) = g{B+A)—g{B)+aA 
positive semidefinite, in which case the absolute value signs can be left out. 
This will all be made clear below. 

The monotone convex angle functions x ax + (x — 1)+ (a > 0) already 
have proven their valour as a testing ground for similar statements, in Section 
[3l Numerical experiments using angle functions for inequality fl23|) did not 
directly lead to any counterexamples, however. This temporarily increased 
our belief that the inequality might actually hold, and led us to investigate, 
as an initial step towards a 'proof, whether the inequality 

k k 

\\{aY + B)<Y, X]{aY + C) 



might be true for all a > 0, where 5 = /(F) and C = /(X + F) - /(X), and 
/(x) = (x - 1) + . 



If the answer to Question [2] is to be affirmative, it should at least hold for all 
angle functions /(x) = ax + 6(x — xo)+. By Proposition [3] this is equivalent to 
the statement 

5{{Y - I)+; Y) 5{{X + F - I)+ - (X - I) + ; Y). 

Consider the 3x3 PSD matrices 

^ 0.35614 -0.053243 0.10116^ 



X 



-0.053243 0.87456 0.40559 
0.10116 0.40559 0.82474 



and 



Y 



0.53642 

0.42018 
0.094866 



The eigenbasis of Y is therefore the standard basis. Then 6{(Y — F) 
(0, 0, 0) and 



^-0.00018194 0.00052449 -0.0016345^ 



(x + r-i)+-(x-i) 



V 



0.00052449 0.2573 0.12368 
-0.0016345 0.12368 0.04 



so that 5{{X + F - 1)+ - (X - 1)+; Y) = (-0.00018194, 0.2573, 0.04). The first 
entry is negative, violating the majorization relation. 
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Now, as mentioned above, this counterexample immediately yields a coun- 
terexample to Question [21 Consider thereto the function /(x) = ax + {x — 1)+ 
with a = 1, say. Then the LHS of the inequality becomes 5{Y + {Y — V)^\Y) = 
(0.53642, 0.42018, 0.094866) and the RHS 5{Y + {X + Y {X - l)+;Y) = 

(0.53624, 0.67748, 0.13487), again violating the inequality. Since Y+{X + Y- 
I)+ — (X — I)+ is a positive definite matrix (as can be checked numerically), 
it is unchanged by putting in the required absolute value signs. 

Even more explicitly, consider the function g{x) = lOlx + {x — 1)+. Then 
X^{g{X + Y) - giX)) = (54.17824,42.69595,9.621004) while X^ig{Y)) = 
(54.17842,42.43818,9.581466). This clearly violates the eigenvalue majoriza- 
tion relation of Question |2l with absolute value signs, because of the positivity 
oig{X + Y)-giX). 



7 Proof of Theorem [2] 

In this section, we give a self-contained proof of Theorem [2] that does not rely 
on the methods of convex analysis and is also valid for complex Hermitian 
matrices, not only real-symmetric ones. For convenience, we reformulate the 
statement of the theorem here. 

Define a proper eigenbasis of a Hermitian matrix A as an orthonormal basis 
in which A is diagonal and its diagonal entries are the eigenvalues of A sorted 
in non-increasing order. 

Theorem 2'. Let A and B be Hermitian matrices. With 6{B] A) defined via 
equation f lT^ . the entries of the vector 6{B] A) are the diagonal entries of 
B in some proper eigenbasis of A. When all eigenvalues of A are simple (i.e. 
have multiplicity 1), this proper eigenbasis is unique; otherwise the required 
one depends on B. 

We need a number of definitions first, and recall some basic facts about the 
perturbation theory of eigenvalue decompositions (see, e.g. [H], Chapter 2, 
Section 1). 

Consider the matrix- valued function z ^ A + zB, z E C, with A and B 
the n X n Hermitian matrices of the theorem. It is well-known that the roots 
of the characteristic function of A + zB are analytic functions of z with only 
algebraic singularities. This means that the number m of (distinct) eigenvalues 
oi A + zB is a constant of z, with the exception of a number of special values 
of which will be called exceptional points. If m < n, we say that A + zB 
is permanently degenerate. In the exceptional points some of the eigenvalues 
may coincide; this is called an accidental degeneracy. 
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In the following we will consider a simply-connected subdomain D of the com- 
plex plane C containing no exceptional points, and such that the intersection 
of D with the real axis is the interval (Oyto), with to > 0. The closure of D is 
denoted D, and its intersection with the real axis is [0,to]- 

We can write the (possibly multiple) eigenvalues of A + zB, z E D, as holo- 
morphic functions Xi{z), X2{z), . . . , Xn{z). For z = t eM, these eigenvalues are 
real and can be sorted. Sorted in non-increasing order they will be denoted as 

xi{t)>xi{t)>...>xi{t),o<t<to. 

Furthermore, we can write the distinct eigenvalues of A + zB, z E D, as a 
fixed number of holomorphic functions fii{z) , fi2{z) , ■ ■ ■ ,fJ'm{z)- We will num- 
ber them such that > /^2(^) > ••• > fim(t) holds for t G (0,to) (or 
t G [0,to))- We denote the multiplicity of /ij by rj. Thus, n = X^I^i'^i- 

The projector on the eigenspace of A + zB corresponding to /ij(z) will be 
denoted by the function Vi{z), z E D, and is called the eigenprojection for 
fii{z). This function is holomorphic on D |14j . 

If 2; = is not an exceptional point, the distinct eigenvalues of A are equal 
to the limiting values /Uj(0), and the corresponding eigenprojections coincide 
with the Pi(0). 

If z = is an exceptional point then an accidental degeneracy occurs and 
A has less than m distinct eigenvalues. Each of these eigenvalues may split 
into several ^iit); that is, limt^o fii{t) = X for several (contiguous) values of 
i, say i = ii, . . . ,i2, where A is a certain eigenvalue of A. In that case, the 
eigenprojection for A of A coincides with the sum \imt^Qj2f=i^'Pj(t)i i-e. the 
limt^Q Vj{t) separately are not themselves eigenprojectors of A. 

Let k be an integer such that there exists an / for which k = ri + r2 + . . . + ri; 
we shall say that such a /c is an entire sum of the multiplicities rj. For such 
values of k, we define the projector V[k){.z) as the sum of eigenprojectors 

'P{k){z) = Viiz) + V2{z) + ...+ Vi{z). 

For z = t real, this is the projector on the subspace spanned by the eigenvectors 
of the k largest eigenvalues (counting multiplicities) of A + tB. Since the Vi{z) 
are holomorphic functions on D, so is V(k){z). By continuity of the eigenvalues 
Xk{z)^ we have for any such /c, 

k k 

Y: Xi{A) = hm Y Xiit) = Tr[^hm P(,)(t) A]. 

j=0 j=0 

If k cannot be written in this way, i.e. k = ri + r2 + . . . + ri + s with s 



21 



a 'remainder' satisfying < s < r^+i, we cannot uniquely define V[k)iz), 
because there is an infinity of s-dimensional subspaces in the eigenspace for 
Hi+i- Hence, we will only define 'P{k){z) for k that are entire sums of rj. 

Finally, we define the projectors Vi^k)- If 2; = is not an exceptional point, and 
k is an entire sum of multiplicities r^, then 'P(fc)(0) is defined, and we define 
^(fc) '■= ^(fc)(0). If -2 = is an exceptional point then A + zB has an accidental 
degeneracy at 2; = 0. Even if k is an entire sum of multiplicities r , of /I + zB, 
it need not be an entire sum of multiplicities of A. Hence, in that case V(k) (t) 
is only defined for t € (0, to) (with to > 0) but not for t — 0. We will then 
define V[k) as the limiting value 

For all other values of k, V{k) will not be defined. 



Lemma 1 If k is such that V(k) is defined ( directly in t — or via the limit 
t^0+), then 

k 

Y,6,{B-A) = TrBVik). 

Proof. Consider the variational characterization of the sum of the k largest 
eigenvalues of a Hermitian matrix H: 

Y,X]{H) = maxTT[HQ], 
j=i ^ 

where Q runs over all rank-A; projectors. If k is such that V(k){H) exists (taking 
the potential degeneracies of H into account) then Q — V{k) (H) achieves the 
maximum, i.e. ma,XQTr[H Q] — Tj:[H V(^k){H)]. 

We have, in particular, that V(k){t) '■= 'P{k){A + tB) (if it exists) achieves the 
maximum for H — A + tB. More precisely, for any t in the open interval 
(0, to), the function u i-)- Tr[{A + tB)V(^k){u)] achieves its maximum over (0, to) 
in the interior point u — t. Since V{k){t) is holomorphic, this function is 
differentiable, hence this maximum must be a stationary point. Thus 

TrP + t5)P(,)(«)] = 0, 

i.e. 

TrP + tS) ^P(,)(t)]=0. 

This imphes 
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d_ 

dt 



d 



Y^X]{t) = -TT[{A + tB)V^,){t)] 



d 



:Tr[fiP(,)(t)]. 



In particular, 



k Q 

Y.5,{A;B) 



dt 



j^Xjit) = lim Tr[EP(,)(t)] = Tr[SP(,)]. 

t^o+ j=i 



□ 



We are now in the position to prove Theorem O Let's first consider the simplest 
case when A is not degenerate, i.e. all eigenvalues of A + zB are simple for z G 
D. In that case V[k){.z) is always defined for all k and all z E D, and, hence, V(k) 
is defined as 'P(k){0) = Y.j=i'Pj{0)- There is a unique unitary matrix U such 
that UAU* = Diag(A~'-(A)), and in this basis the projector Vj is expressed as 
e^^. Hence, by the lemma we have that Z]j=i Sj{B; A) = Tr BV^k) = J2^=i ^jj^ 
where the Bjj are the diagonal elements of B expressed in that same basis. 
Therefore, for all j, Sj{B] A) = Bjj. 

If A is degenerate, there is no unique eigenbasis of A. However, the lemma 
only requires us to deal with the limits \imt_^Q+ V[k){t)- If the degeneracy of A 
is lifted completely in A + zB, i.e. all eigenvalues of A split into simple eigen- 
values, then all Vj{t) are rank-1 projectors and T'(k)i't) = I]j=i 'Pjii)- Further- 
more, letting ii and ^2 be any pair of indices such that an eigenvalue of A splits 
into the eigenvalues fii^^, . . . , fii^ of A + zB, we have that limt_j.o Ylf=ii '^jif) is 
an eigenprojector of A. Therefore, there exists a unique proper eigenbasis 
of A (determined by B) in which \im.t^QVj{t) = the elementary matrix 
with a 1 in position (j, j) and zeroes elsewhere. Again we find that, for all j, 
8j[B] A) = Bjj in that proper eigenbasis. 

The most complicated case arises when A + zB is permanently degenerate, i.e. 
the degeneracies are not lifted completely, as some eigenvalues of A may split 
into still degenerate eigenvalues /ij of A + zB, with multiplicities r,. Then the 
projectors Vi{z) have rank r,, and the V(k){t) are only defined when k is an 
entire sum of the multiplicities r^. There still exists a proper eigenbasis of A 
in which the projectors limt^o '^ji't) diagonal, now of the form © I^.. © 0, 
but it is no longer unique; we will exploit exactly this freedom to deal with k 
that are not entire sums. 

If k is not an entire sum of rj, we have k = ri + r2 + . . . + ri + s, with s the 
remainder term, satisfying 1 < s < r^+i. We first write k as an interpolated 
value between two entire sums as follows: 
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k = (ri + r2 + . . . + r;+i) + (1 )(ri + + . . . + r^) 

= q;/c+ + (1 — a;)A;_. 

Here we defined a = s/r;+i, and the two entire sums k_ = ri + r2 + . . . + 
and k+ = ri + r2 + . . . + r;+i. We can express Z]j=i -^j as a linear interpolation 

between Et=i -^t ^^"^ T.'jti ^i- 



k I 



j=i i=i 

= Ty[{A + tB) {Vi{t) + ... + Vi{t) + -^Vi+^m 

ri+i 

= Ti[{A + tB) + (1 - 

= a Tt[{A + tB) P(fc+)(t)] + (!-«) Tt[{A + t5) 

Applying the Lemma to both terms, we obtain 

k I 

Y.S,{B;A) = Tr[fi(aP(,^) + (1 - = ^ Tr + a Tr fiP,+i.(25) 

j=i i=i 



Again, to deal with eigenvalue splitting at 2; = 0, each of the Vi corresponds 
to the limit limt_^o+ "Piit)- 

Let us consider a partitioning of B in an eigenbasis of A + zB mentioned 
before, in which the Vi{z) appear in the form © © 0. That is, in B we can 
single out blocks on its diagonal, each of which corresponds to an eigenspace 
of A + zB; Then Tt BVi{z) is the sum of all diagonal elements of the z-th 
block of B. 

The degeneracy of the eigenvalues fii{z) means that this eigenbasis is still not 
unique and is determined up to 'local' rotations within each of the eigenspaces. 
We can use this freedom to make the diagonal elements of B equal within each 
block. This allows us to get rid of a in ( l25l) . Indeed, as a = s/r^+i and Tr BVi+i 
is the sum of all r/+i diagonal elements of the (/ + l)-th block of B, then if 
all these diagonal elements are equal, a Tr BVi+i{z) is equal to the sum of the 
first s diagonal elements of B in that block. 

Wrapping up we find that Y.i=i Tr BVi + a Tr BVi+i equals the sum of the 
first Ti + r2 + ■ ■ ■ + ri + s = k diagonal elements of B in the chosen eigenbasis. 
Taking the limit 2; = t — )■ 0, we finally obtain that, again, there is a proper 
eigenbasis of A in which 

k k 

Y^6,{B;A) = J2B,„ 
i=i i=i 
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and hence 5j{B;A) — Bjj. □ 
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